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RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical InformatiDn 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: /ohsi-. 

Source: I (yC(^h: 

Date Processed by STIC: Z^/Xy vaz> ^ ^ 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: patin21help@uspto.gov or phone 703-306-4119 (R Wax) 
PATENTIN 3.0 e-mail help: patin3help@uspto.gOV or phone 703-306-4119 Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3,1 PROGRAM . ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www,uspto.gov/web/offices/pac/checker 

Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 20231 zip code address for the 

United States Patent and Trademark OfFice, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://www.uspto.gQv/ebc/efs/downloads/documents.htm> , EFS Submission 
User Manual - ePAVE) 

2. U.S. Postal Service: U.S. Patent and Trademark Office, Box Sequence, P.O. Box 2327, Arlington, VA 22202 

3. Hand Carry directly to: 

U.S. Patent and Trademark Office, Technology Center 1600, Reception Area, 7* Floor, Examiner Name, 
Sequence Information, Crystal Mall One, 1911 South Clark Street, Arlington, VA 22202 
Or 

U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Phiza Two, 
201 1 South Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcel Service, or other delivery service to: U.S. Patent and Trademark Office, 
Box Sequence, Room lB03-Maih-oom, Crystal Plaza Two, 201 1 South Clark Place, Arlington. VA 22202 



Revised 01/29/2002 



Raw Sequence JListing Error Summary . 



SERIAL NUMBER: 



ERROR DETECTED SUGGESTED CORRECTION 
ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BV PTO SOFTWARE 

I W rapped Nucldcs The numberAext at Ihe end of each line ••wrapped" down to the next line. This may occur if yourTlle 

Wrapped Aminos - was retrieved in a word processor after creating it Please adjust your right margin to .3; this will 
, prevent '•wrapping." 



_lnvalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 

Misaligned Amino The numbering under each 5*^ amino acid is misaligned. Do not use tab codes between numbers; 
use space characters, instead. 



Numbering 
4- ^Non-ASCn 

5 Variable Length 



_PatcntIn 2.0 
"bug" 



The submitted file was not saved in ASCII(E)OS) text, as rcqtflrcd by the Sequence Rules. Please 
ensure your subsequent submission is saved in ASCII text 

Sequcnce(s ) c ontain n's or Xaa's representing more than one residue. Per Sequence Rules, 
each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. ^ 

A "bug" in Patcntin version 2.0 has caused the <220>-<223> section to be missing from amino acid 

scqucnces(s) . Normally, PatenLin would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 

7 Skipped Sequences Scqucncc(s) missing. If intentional, please insert the following lines for each skipped sequence: 

(OLD RULES) (2) INFORMATION FOR SEQ ED NO:X: (insert SEQ ED NO where "X" is shown) 

(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under \}\is heading) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to Include Uic skipped sequences. 

8 Skipped Sequences Sequcncc(s) ______ missing. If intentional, please insert the following lines for each skipped sequence. 

(NEW RULES) <2 1 0> sequence id number 
<400> sequence id number 
000 



_Usc of n's or Xaa's 
(NEW RULES) 



10 Invalid <213> 

Response 

/' 

11 *^ Useof<220> 



12^ Patentin 2.0 

"bug" 



Use of n*s and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's arc present 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence Rules, the only valid <213> responses arc: Unknown, Artificial Sequence, or 
scientific name (Genus^species). <220>-<223> section is required when <213> response is Uijlaiown or 
is Artificial Sequence 

Scquencc(s) ^ / missing the <220> "Feature" and associated numeric identifiers and responses. 
Use of <220> to <223> is MANDATORY if <2I3> "Organism" response is "Artificial Sequence" or 
"Unknown." Please explain source of genetic material in <220> to <223> section. 
(See 'Tcdend Register,*' 06A)1/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use **Copy to Disk" function of Patentin version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use 'Tile Manager** or any other manual means to copy file toiloppy disk. 
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OC'es Not Comply 
Corrected Diskette Needed 




OIPE 



RAW SEQUENCE LISTING DATE: 02/15/2002 

PATENT APPLICATION: US/10/057 / 834 TIME: 14:10:04 

Input Set : A:\Arcd358.app 

Output Set: N:\CRF3\02152002\J057834.raw 

3 <110> APPLICANT: RATAIN, MARK J. 

4 INNOCENTI, FEDERICO 

5 DAS , SOMA 

6 IYER, LALITHA 

7 SAWYER, MICHAEL 

9 <120> TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR OPTIMIZING UGT2B7 SUBSTRATE 
DOSINGS AND FOR 

10 PREDICTING UGT2B7 SUBSTRATE TOXICITY 

12 <130> FILE REFERENCE: ARCD:358US 
C--> 14 <140> CURRENT APPLICATION NUMBER: US/10/057,834 
C--> 14 <141> CURRENT FILING DATE: 2002-01-25 

14 <150> PRIOR APPLICATION NUMBER: UNKNOWN 

15 <151> PRIOR FILING DATE: 2002-01-25 

W--> 17 <140> CURRENT APPLICATION NUMBER: 60/264,534 
C--> 18 <141> CURRENT FILING DATE: 2001-01-26 

21 <160> NUMBER OF SEQ ID NOS : 78 

2 3 <170> SOFTWARE: Patentin Ver. 2.1 

25 <210> SEQ ID NO: 1 

26 <211> LENGTH: 1991 

27 <212> TYPE: DNA 

28 <213> ORGANISM: Homo sapiens 



30 <220> FEATURE: 

31 <221> NAME/KEY 

32 <222> LOCATION 
34 <40 0> SEQUENCE 



CDS 

(151) . . (1740) 
1 

35 catgctcaga ctgttgattt aatgatattg tatgtacttt gacttataag ggttacattt 60 
37 taacttcttg gctaatttat ctttggacat aaccatgaga aatgacagaa aggaacagca 120 
39 actggaaaac aagcattgca ttgcaccagg atg tot gtg aaa tgg act tea gta 174 
4 0 Met Ser Val Lys Trp Thr Ser Val 

41 15 

43 att ttg eta ata caa ctg age ttt tgc ttt age tet ggg aat tgt gga 222 

44 lie Leu Leu lie Gin Leu Ser Phe Cys Phe Ser Ser Gly Asn Cys Gly 

45 10 15 20 

47 aag gtg ctg gtg tgg gca gca gaa tac age cat tgg atg aat ata aag 270 

4 8 Lys Val Leu Val Trp Ala Ala Glu Tyr Ser His Trp Met Asn lie Lys 
49 25 30 35 40 

51 aea ate etg gat gag ett att cag aga ggt eat gag gtg aet gta ctg 318 

52 Thr lie Leu Asp Glu Leu lie Gin Arg Gly His Glu Val Thr Val Leu 

53 45 50 55 

55 gca tet tea get tee att ctt ttt gat ccc aac aac tea tee get ctt 366 

5 6 Ala Ser Ser Ala Ser lie Leu Phe Asp Pro Asn Asn Ser Ser Ala Leu 
57 60 65 70 

59 aaa att gaa att tat ccc aca tet tta aet aaa aet gag ttg gag aat 414 

60 Lys lie Glu lie Tyr Pro Thr Ser Leu Thr Lys Thr Glu Leu Glu Asn 
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RAW SEQUENCE LISTING DATE: 02/15/2002 

PATENT APPLICATION: US/10/057,8 34 TIME: 14:10:04 



Input Set : A:\Arcd358.app 

Output Set: N:\CRF3\02152002\J057834.raw 
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RAW SEQUENCE LISTING DATE: 02/15/2002 

PATENT APPLICATION: US/10/0 57,834 TIME: 14:10:04 

Input Set : A:\Arcd3 58.app 

Output Set: N:\CRF3\02152002\J057834.raw 
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Gly 


Lys 


Asn 


Asp 


















--> 173 










525 










530 
















175 


tggaaaacct gataggtgag actacttcag 


tttattccag 


caagaaagat tgtgatgcaa 


1820 


177 


gatttctttc ttcctgagac aaaaaaaaaa 


aaaagaaaaa 


aaaatctttt caaaatttac 


1880 


179 


tttgtcaaat aaaaatttgt ttttcagaga 


tttaccaccc 


agttcatggt tagaaatatt 


1940 


181 


ttgtggcaat gaagaaaaca ctacggaaaa 


taaaaaataa 


gataaagcct t 




1991 


184 


<210> SEQ ID NO: 


2 


























185 


<211> LENGTH 


: 529 


























186 


<212> TYPE: 


PRT 




























187 


<213> ORGANISM: 


Homo sapiens 




















189 


<400> SEQUENCE: 


2 


























190 


Met 


Ser 


Val 


Lys 


Trp 


Thr 


Ser 


Val 


He 


Leu 


Leu 


He 


Gin 


Leu 


Ser 


Phe 




191 


1 








5 










10 










15 






192 


Cys 


Phe 


Ser 


Ser 


Gly 


Asn 


Cys 


Gly 


Lys 


Val 


Leu 


Val 


Trp 


Ala 


Ala 


Glu 




193 








20 










25 










30 
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194 


J. y X 


Ser 


His 


± J. |j 


Met 


Asn 


He 


Lys 


Thr 


He 


Leu 




CZl n 
Vjt X u. 


XjcJU. 


T 1 
X X t; 


VjX 11 


195 






35 










40 










45 








196 


Airg" 


Gly 


His 


Glu 


Val 


Thr 


Val 


Leu 


Ala 


Ser 


C Q 


Ala 


Ser 


He 


Leu 


Phe 


197 




50 










55 










60 










198 


Asp 


Piro 


Asn 


Asn 


Ser 


Ser 


Ala 


Leu 


Lys 


He 


Glu 


He 


T VT 

1 y X 


Pro 


Thr 


Ser 


199 


6 5 










70 










7 S 










80 


200 


Leu 


Thir 


Lys 


Thr 


Glu 


Leu 


Glu 


Asn 


Phe 


He 


Met 


Gin 


Gin 


He 


Lys 


Arg 


201 










85 










90 










95 




202 


Tr*D 
J. J. ^ 


Seir 


Asp 


Leu 


Pro 


Lys 


Asp 


Thr 


Phe 


T "rr> 
X xp 




±yx 


Phe 


Ser 


Gin 


Val 


203 








100 










105 










110 






204 


Gin 


Glu 


lie 


Met 


Ser 


He 


Phe 






X Xt5 


Vt -r* 
X Ilx 


Arg 


Lys 






Lys 


205 






115 










120 










125 








206 


Asp 


Val 


Val 


Ser 


Asn 


Lys 


Lys 


Phe 


Met 


Lys 


_ 

i-iy s 


Val 


Gin 


Glu 


Ser 


rg 


207 




130 










135 










X *± \J 










208 


Phe 


Asp 


Val 


He 


Phe 


Ala 


Asp 


Ala 


He 


Phe 




v^y 13 


o cx 


VjX U. 


T.on 
xj t; u. 


XltJ LI 


209 


145 










150 










155 










160 


210 


Ala 


Glu 


Leu 


Phe 


As n 


He 


Pro 


Phe 


Val 


X y X 


Ser 


Leu 


Ser 


Phe 


Ser 


Pro 


211 










165 










17 0 










17 5 




212 


Glv 

VJ J- y 


T VT 

± Y d. 


Thr 


Phe 


Glu 


Lys 


His 


Ser 


Gly 


Gly 


Phe 


He 


Phe 


Pro 


IT X V^ 


Ser 


213 








180 










185 










190 






214 


TVT 
J. y J. 


Val 


Pro 


Val 


Val 


Met 


Ser 


Glu 


Leu 


Thr 


A art 


OXll 




X IIX 


Pho 
relics 


luc: U 


215 






195 










200 










^ U J 








216 


Glu 


Airg 


Val 


Lys 


Asn 


Met 


He 


T vr 


Val 


Leu 


lyx 


Phe 


Asp 


Phe 


xxp 


Phe 


217 




210 










215 










220 










218 


Glu 


lie 


Phe 


Asp 


Met 


Lys 


Lys 




no ^ 


f^l n 

UXli 


It lit: 


lyr 


O t- X 


vjX U. 


V Cl X 


XitiU. 


219 


225 










230 










23 5 










240 


220 


Gly 


Al*g 


Pro 


Thr 


Thr 


Leu 


o fc; X. 


V7 J_ U. 


± IIX 




vjx y 


Lys 


M-XCl 


Asp 


V a. X 


irp 


221 










245 










2 50 










2 5 5 




222 


Leu 


lie 


Arg 


Asn 


Ser 


± X p 


As n 


Phe 


Gin 


Phe 


Pro 


nx o 


Pro 


Leu 


Leu 


IrX 


223 








260 










2 6 5 










270 






224 


Asn 


Val 


Asp 


Phe 


Val 


Gly 


Gly 


Leu 


nx o 


v^y o 


T. VG 
Xjy o 


IT X u 


Ala 
r\X d 


xiy o 


XT X 


Leu 


225 






275 










280 










285 








226 




Lys 


Glu 


Met 


Glu 


Asp 


XT 


V Cl X 


VjX 11 


o tr X 


OCX 


Gly 


m n 

X ix 


Asn 


vixy 


V cl X 


227 




290 










295 










300 










228 


Val 


Val 


Phe 


Ser 


Leu 


Gly 


Ser 


Met 


Val 


Ser 


Asn 


Met 


Thr 


Glu 


Glu 


Arg 


229 


305 










310 










315 










320 


230 


Ala 


Asn 


Val 


He 


Ala 


Ser 


Ala 


Leu 


Ala 


Gin 


He 


Pro 


Gin 


Lys 


Val 


Leu 


231 










325 










330 










335 




232 


Trp 


Arg 


Phe 


Asp 


Gly 


Asn 


Lys 


Pro 


Asp 


Thr 


Leu 


Gly 


Leu 


Asn 


Thr 


Arg 


233 








340 










345 










350 






234 


Leu 


Tyr 


Lys 


Trp 


He 


Pro 


Gin 


Asn 


Asp 


Leu 


Leu 


Gly 


His 


Pro 


Lys 


Thr 


235 






355 










360 










365 








236 


Arg 


Ala 


Phe 


He 


Thr 


His 


Gly 


Gly 


Ala 


Asn 


Gly 


He 


Tyr 


Glu 


Ala 


He 


237 




370 










375 










380 










238 


Tyr 


His 


Gly 


He 


Pro 


Met 


Val 


Gly 


He 


Pro 


Leu 


Phe 


Ala 


Asp 


Gin 


Pro 


239 


385 










390 










395 










400 


240 


Asp 


Asn 


He 


Ala 


His 


Met 


Lys 


Ala 


Arg 


Gly 


Ala 


Ala 


Val 


Arg 


Val 


Asp 


241 










405 










410 










415 




242 


Phe 


Asn 


Thr 


Met 


Ser 


Ser 


Thr 


Asp 


Leu 


Leu 


Asn 


Ala 


Leu 


Lys 


Arg 


Val 
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243 420 425 430 

244 lie Asn Asp Pro Ser Tyr Lys Glu Asn Val Met Lys Leu Ser Arg lie 

245 435 440 445 

246 Gin His Asp Gin Pro Val Lys Pro Leu Asp Arg Ala Val Phe Trp lie 

247 450 455 460 

248 Glu Phe Val Met Arg His Lys Gly Ala Lys His Leu Arg Val Ala Ala 

249 465 470 475 480 

250 His Asp Leu Thr Trp Phe Gin Tyr His Ser Leu Asp Val He Gly Phe 

251 485 490 495 

252 Leu Leu Val Cys Val Ala Thr Val He Phe He Val Thr Lys Cys Cys 

253 500 505 510 

2 54 Leu Phe Cys Phe Trp Lys Phe Ala Arg Lys Ala Lys Lys Gly Lys Asn 

255 515 520 525 

256 Asp 

260 <210> SEQ ID NO: 3 

261 <211> LENGTH: 20 

262 <212> TYPE: DMA 

263 <213> ORGANISM: Homo sapiens 

265 <400> SEQUENCE: 3 

266 gtgtcaatgg actgcagaac 20 

269 <210> SEQ ID NO: 4 

270 <211> LENGTH: 20 

271 <212> TYPE: DNA 

272 <213> ORGANISM: Homo sapiens 

274 <400> SEQUENCE: 4 

275 cctttccaca attcccagag 20 

278 <210> SEQ ID NO: 5 

279 <211> LENGTH: 20 

280 <212> TYPE: DNA 

281 <213> ORGANISM: Homo sapiens 

283 <400> SEQUENCE: 5 

284 cttggctaat ttatctttgg 20 

287 <210> SEQ ID NO: 6 

288 <211> LENGTH: 19 

289 <212> TYPE: DNA 

290 <213> ORGANISM: Homo sapiens 

292 <400> SEQUENCE: 6 

293 cccactaccc tgactttat 19 

296 <210> SEQ ID NO: 7 

297 <211> LENGTH: 20 

298 <212> TYPE: DNA 

299 <213> ORGANISM: Homo sapiens 

301 <400> SEQUENCE: 7 

302 ggacataacc atgagaaatg 20 

305 <210> SEQ ID NO: 8 

306 <211> LENGTH: 19 

307 <212> TYPE: DNA 

308 <213> ORGANISM: Homo sapiens 
310 <400> SEQUENCE: 8 
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Numeric Identifier already exists, <140> found multiple times 
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<210> 71 
<211> 20 
<212> DNA 

<213> Artificial Sequence J" 
<400> 71 

tctgagcatg tggatggcaa 



20 

Uc 4* t( ^^^^ 



